The Signature Molecular Descriptor. 2. Enumerating Molecules from Their Extended Valence Sequences
نویسندگان
چکیده
We present a new algorithm that enumerates molecular structures matching a predefined extended valence sequence or signature. The algorithm can construct molecular structures composed of about 50 non-hydrogen atoms in CPU seconds time scale. The algorithm is run to produce all molecular structures matching the binding affinities (IC(50)) of some HIV-1 protease inhibitors. The algorithm is also used to compute the degeneracy, or the number of molecular structures, corresponding to a given signature. Signature degeneracy is systematically studied for varying signature heights on four molecular series, alkanes, alcohols, fullerene-type structures, and peptides. Signature degeneracy is compared with similar results obtained with popular topological indices (TIs). As a general rule, we find that signature degeneracy decreases as the signature height increases. We also find that alkanes, alcohols, and fullerene-type structures comprising n non-hydrogen atoms are uniquely characterized by signatures of height n/4, while peptides up to 4000 amino acids can be singled out with signatures of heights as small as 2 and 3.
منابع مشابه
The Signature Molecular Descriptor. 4. Canonizing Molecules Using Extended Valence Sequences
We present a new algorithm to canonize molecular graphs using the signature molecular descriptor introduced in the previous papers of this series. While developed specifically for molecular structures, the algorithm can be used for any graph and is not limited to acyclic graphs, planar graphs, bounded valence, or bounded genus graphs, for which polynomial time algorithms exist. The algorithm is...
متن کاملA Novel Molecular Descriptor Derived from Weighted Line Graph
The Bertz indices, derived by counting the number of connecting edges of line graphs of a molecule were used in deriving the QSPR models for the physicochemical properties of alkanes. The inability of these indices to identify the hetero centre in a chemical compound restricted their applications to hydrocarbons only. In the present work, a novel molecular descriptor has been derived from the w...
متن کاملMolecular Study of Mycobaterium avium-intracellular Complex Strains
It is difficult to distinguish between clinically significant slowly-growing, non-pigmented mycobacteria, notably to separate M. aviumand M. intracellulare from one another and from M. scrofulaceum strains. The purpose of this study was to evaluate the extent to which 16S rRNA sequencing could be used to highlight the taxonomic relationships of the mycobacterial strains, which are difficult to ...
متن کاملBiological Activity of Chemical Compounds and Their Molecular Structure-Information Approach
A method is proposed for bioscreening chemical compounds. We propose the systemic signs within the framework of an informational approach and the statistical method of compare of molecular qualitative characters. Using the methods of information theory, we offer four classification rules that allow statistically reliably distinguish preparations with high biological action (radioprotection). Fo...
متن کاملExact sequences of extended $d$-homology
In this article, we show the existence of certain exact sequences with respect to two homology theories, called d-homology and extended d-homology. We present sufficient conditions for the existence of long exact extended d- homology sequence. Also we give some illustrative examples.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of chemical information and computer sciences
دوره 43 3 شماره
صفحات -
تاریخ انتشار 2003